🧮 Compute Optimization - matmat · Scour

🚀Compiler Optimizations hiraditya.github.io·

Loop Unrolling in the ML Era

Discussed on Hacker News

⚡SIMD Optimization shnatsel.medium.com·

Safe SIMD in Rust, even on the inside

Discussed on Hacker News and Lobsters

🚀SIMD Text Processing extractingcycles.com·

The IPv4 Parser AI Couldn't Have Written

Covers Compiler Explorer

Discussed on Hacker News

⚡SIMD Optimization zeux.io·

Zigzag decoding with AVX-512

Covers uops.info

Discussed on Hacker News

🔒Type Safety Phoronix·

Rust PNG Image Decoder Now Even Faster: Benefiting Chrome, GNOME, Etc

⚡Parallel Computing indianspeedster.github.io·

Occupancy Math on the AMD MI355X: A From-First-Principles Guide

Discussed on Hacker News, Hacker News, and Hacker News

🖥️Hardware Architecture Sylvain Kerkour·

Hashing at 130 GB/s with XXH3, Rust and SIMD instructions on AMD Zen 5

Discussed on Hacker News

🧩RISC-V Assembly atticarun.itch.io·

Foundry-5: browser puzzle game that teaches you real RISC-V assembly

Discussed on Hacker News

🔒Type Safety blog.image-rs.org·

Rust PNG crate gets even faster, used by GNOME and Chromium

Covers google/oss-fuzz

Covered by Phoronix

Discussed on Hacker News

🎯Retrieval Systems arxiv.org·

MonaVec: A Training-Free Embedded Vector Search Kernel for Edge and Offline AI Systems

Covers Easy way to do both: async <-> sync (crates.io dump loading and parsing example)

⚡Parallel Computing Akin Ocal·

Building a High-Throughput FIX Server

Discussed on Substack

🚀Indie Hacking GitHub·

Blaise v0.11.0 Is Here

Discussed on Hacker News

⚡Parallel Computing hackernoon.com·

Why Speed Matters: How Performance in Analytics Saves Business from "Digital Paralysis"

💨Cache Optimization Phoronix·

Intel Performance Skills: New Open-Source Project Leveraging AI For Linux Performance Optimizations

📐Linear Algebra arxiv.org·

Evaluating Rust for Sparse Matrix Kernels in Scientific Computing

⚡Parallel Computing GitHub·

Symbolica: high-performance computer algebra library for Python and Rust

Discussed on Lobsters

⚡Parallel Computing arxiv.org·

Diagonal-Budgeted Trotterization for Efficient Quantum Hamiltonian Simulation

📊Vector Quantization GitHub·

RunEdgeAI/turboquant.cpp: Near-optimal online vector quantization in C++23 — 1-4 bits per coordinate, no training, no codebooks

Covers TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate

Discussed on Hacker News

💻Operating System, OS arxiv.org·

Mojo: A Promising Tool for Scalable Financial AI Efficiency

🔒Type Safety GitHub·

Rust port of transformers (1M lines of code)

Discussed on Hacker News

No more posts from matmat's subscribed feeds.

Scour all 25,324 feeds Learn more about Feeds

Log in to enable infinite scrolling